IMPROVED INVASION ASSAY 

1. FIELD OF THE INVENTION 

The invention relates to improved methods of detecting and characterizing nucleic 
5 acid sequences and variations in nucleic acid sequences. 

2. BACKGROUND OF THE INVENTION 

Since the development of the Polymerase Chain Reaction (PCR), the demand for 
fast, reliable, and cost-effective tests for the detection of specific nucleic acid sequences 

10 has led to the development of a variety of new assay techniques. One of these, referred to 
as the INVADER™ (a trademark of Third Wave Technologies, Madison, Wisconson) 
Assay, Invasion Cleavage Assay, or Invasion Assay, does not require the use of PCR. 
Invasion assays are highly sensitive and can be used to determine, for example, single- 
base differences of specific nucleotide targets. See, e.g., Lyamichev, V., et ah, Nature 

15 Biotech. 17:292-296 (1999); and Ryan, D., et al, Molecular Diag. 4(2):135-144 (1999), 
each of which is incorporated herein by reference. 

Invasion assays are based on the ability of some enzymes {e.g., endonucleases) to 
cleave nucleotide "flaps," such as those shown in FIG. 1 A, that are formed when an 
oligonucleotide complementary to part of a DNA template overlaps with another 

20 oligonucleotide complementary to another part of the template. Specific enzymes, such 
as FEN 1, cleave flaps formed when the 3' end of an upstream oligonucleotide 
complementary to part of a DNA template overlaps with the 5' end of a downstream 
oligonucleotide complementary to another part of the DNA template. As shown in FIG. 
IB, cleavage by such an enzyme of a downstream oligonucleotide (identified as Probe in 

25 FIG. IB) provides a fragment that corresponds to the overlap between the two 

oligonucleotides complementary to the template. See Lyamichev, V., et al., Nature 
Biotech. 17:292-296 (1999). 

Many endonucleases are thermostable, and can be used near the melt temperatures 
of the probe oligonucleotides they cleave. See Kaiser, M.W., et al.,J. Biol. Chem. 

30 274(30):21387-21394 (1999), and patents and published patent applications referred to 

herein. Most invasion assays take advantage of this characteristic by using a molar excess 
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of a probe oligonucleotide that, at or near its melt temperature, will rapidly associate and 
disassociate with the template DNA. This allows a new, uncleaved probe to replace a 
cleaved probe such that when the experiment is complete, the quantity of cleaved 
fluorescent fragments is significantly larger than the amount of template DNA. See, 
5 Lyamichev, V., et al, Nature Biotech. 17:292-296 (1999). 

FIG. 2 provides a representation of an invasion assay that is disclosed by, for 
example, U.S. Patent No. 5,994,069, which is incorporated herein by reference. In this 
assay, the fragment cleaved from a probe oligonucleotide invades a second structure 
(referred to herein as a "reporter precursor") that contains fluorescent reporter R and a 

10 quencher Q. Association of the probe fragment with the second structure forms a flap that 
is cleaved by enzyme in the mixture. This cleavage breaks the connection between the 
reporter R and quencher Q and provides a fluorescent reporter fragment. 

Theoretically, cleavage of the reporter precursor shown in FIG. 2 can occur only 
after a secondary invader oligonucleotide fragment is created by the primary invasion 

15 reaction. In some cases, however, the primary invader oligonucleotide, which is intended 
to only bind to the template DNA, also binds to some degree to the secondary structure, 
causing cleavage of that structure. This cleavage creates a background signal that must be 
considered when determining the result of the assay. Consequently, invasion assays such 
as are represented by FIG. 2 have been run with a control, i.e., an assay in which template, 

20 or target, DNA is not present. The fluorescent signal measured at the completion of this 
control assay is the signal due to the undesired cleavage of the secondary structure by the 
primary invader oligonucleotide. This control signal is subtracted from that measured at 
the completion of the test assay (i.e., the assay with the template DNA) to provide an 
approximation of the signal due only to an overlap between the invader oligonucleotide 

25 and probe oligonucleotides in the primary invasion reaction. In theory, a positive net 
signal indicates that the invader oligonucleotide and probe oligonucleotide are 
complementary to different parts of the DNA template and overlap. 

The invasion assay is a powerful analytical tool that can be used to supplement, or 
even replace, existing assays such as PGR. It is desirable, however, to increase its speed 

30 and accuracy. It is also desirable to reduce the amount of probe material that must be 
used to provide a useful (e.g. , reliable) result. These and other needs are met by the 
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invention disclosed herein. 

3. SUMMARY OF THE INVENTION 

This invention is directed to an improved invasion assay and to methods of 
5 analyzing invasion assay data. The invention is based, in part, on a discovery that signals 
due to the cleavage of a probe annealed to a template polynucleotide {e.g., DNA or RNA) 
can exhibit behavior over time that is different from the behavior over time of signals 
produced by background reactions {e.g. , reactions that are not dependent on the presence 
of target polynucleotide). For example, the positive signal produced by many invasion 
10 assays {i.e., the signal that indicates an cleavage of a probe oligonucleotide annealed to a 
target polynucleotide) can be fit to a polynomial that is higher in order than the 
polynomial required to characterize background signal. 

It is thus possible to determine the results of an invasion assay in real time or at 
the conclusion of the assay by determining whether the behavior of the signal it produced 
15 over time is different from that of a typical, estimated, theoretical or actually measured 
background signal. 

3.1. BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 A provides a schematic representation of the configurations assumed by 
20 oligonucleotides with an overlap complementary to a DNA target. The 3' end of the 
upstream oligonucleotide (A) or the 5' end of the downstream oligonucleotide (C) can 
each be displaced by the other, with both conformers interchanging through an 
intermediate form (B). Endonucleases such as FEN1 cleave only the flap created by 
displacement of the downstream segment (C). See Lyamichev, V., et ah, Nature Biotech. 
25 17:292-296 (1999). 

FIG. IB provides an example of a target region and the cleavage structures 
resulting form the overlap of 0, 1, 3, 5, and 8 nucleotides and the structure of a probe and 
five invader oligonucleotides. The underlined nucleotides at the 3' end of the invader 
oligonucleotides indicate the extent of the overlap with the probe oligonucleotide. The 
30 labeled arrows above the probe show the cleavage points induced by each invader 

oligonucleotide. The star indicates a fluorescent label. See Lyamichev, V., et al, Nature 
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Biotech. 17:292-296 (1999). 

FIG. 2 provides a schematic representation of a type of invasion assay herein 
referred to as a "cascade invasion assay." 

FIG. 3 shows one positive assay and four negative assays. 
5 FIG. 4 shows idealized invasive cleavage assay response curves. 

FIG. 5 shows the effect of domain transformation on a family of response 
functions simulated with added random noise. 

FIG. 6 shows the second order coefficients obtained from fits of the data shown in 

FIG. 3. 

10 FIG. 7 shows the transformation of experimental responses obtained from the data 

shown in FIG. 3. 

FIG. 8 shows transformed signals (top panel) and the residuals obtained when a 
linear model is fit to signals from wells 5 and 1 9 in the data set of Table 1 . 

15 3.2. DEFINITIONS 

" Nucleobase :" refers to a substituted or unsubstituted nitrogen- containing parent 
heteroaromatic ring of a type that is commonly found in polynucleotides. Typically, but 
not necessarily, the nucleobase is capable of forming Watson-Crick and/or Hoogsteen 
hydrogen bonds with an appropriately complementary nucleobase. Exemplary 

20 nucleobases include, but are not limited to, purines such as 2-aminopurine, 

2,6-diaminopurine, adenine (A), ethenoadenine, N 6 -A 2 -isopentenyladenine (6iA), N 6 - 
A 2 -isopentenyl-2-methylthioadenine (2ms6iA), iV^-methyladenine, guanine (G), 
isoguanine, N 2 -dimethylguanine (dmG), 7-methylguanine (7mG), 2-thiopyrimidine, 
6-thioguanine (6sG) hypoxanthine and O^-methylguanine; 7-deaza-purines such as 

25 7-deazaadenine (7-deaza-A) and 7-deazaguanine (7-deaza-G); pyrimidines such as 
cytosine (C), 5-propynylcytosine, isocytosine, thymine (T), 4-thiothymine (4sT), 
5,6-dihydrothymine, 0 4 -methylthymine, uracil (U), 4-thiouracil (4sU) and 
5,6-dihydrouracil (dihydrouracil; D); indoles such as nitroindole and 4-methylindole; 
pyrroles such as nitropyrrole; nebularine; base (Y); etc. Additional exemplary 

30 nucleobases can be found in Fasman, 1989, Practical Handbook of Biochemistry and 
Molecular Biology, pp. 385-394, CRC Press, Boca Raton, FL, and the references cited 
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therein. Preferred nucleobases are purines, 7-deazapurines and pyrimidines. Particularly 
preferred nucleobases are the normal nucleobases, defined infra, and their common 
analogs, e.g. , 2ms6iA, 6iA, 7-deaza-A, D, 2dmG, 7-deaza-G, 7mG, hypoxanthine, 4sT, 
4sU and Y. 

" Normal Nucleobase :" refers to a nucleobase that is naturally-occurring and 
encoding, i.e., adenine, cytosine, guanine, thymine or uracil. 

" Nucleoside :" refers to a compound consisting of a nucleobase covalently linked, 
typically via a heteroaromatic ring nitrogen, to the CI' carbon of a pentose sugar. Typical 
pentose sugars include, but are not limited to, those pentoses in which one or more of the 
carbon atoms are each independently substituted with one or more of the same or different 
-R, -OR, -NRR or halogen groups, where each R is independently hydrogen, (C r C 6 ) 
alkyl or (C 5 -C 14 ) aryl. The pentose sugar maybe saturated or unsaturated. Exemplary 
pentose sugars include, but are not limited to, ribose, 2'-deoxyribose, 
2'-(C 1 -C 6 )alkoxyribose, 2'-(C 5 -C 14 )aryloxyribose, 2 , ,3'-dideoxyribose, 
2',3'-didehydroribose, 2'-deoxy-3'-haloribose, 2'-deoxy-3'-fluororibose, 2'-deoxy- 
3'-chlororibose, 2'-deoxy-3'-aminoribose, 2'-deoxy-3'-(C 1 -C 6 )alkylribose, 2'-deoxy-3'- 
(C r C 6 )alkoxyribose, 2'-deoxy-3'-(C 5 -C 14 )aryloxyribose, 2',3'-dideoxy-3'-haloribose and 
2',3 l -dideoxy-3'-fluororibose. 

When the nucleobase is a purine or a 7-deazapurine, the pentose sugar is attached 
to the N9 or C8 position of the nucleobase. When the nucleobase is a pyrimidine, the 
pentose sugar is attached to the Nl -position of the nucleobase (see, e.g., Kornberg and 
Baker, 1992, DNA Replication, 2 nd Ed., Freeman, San Francisco), except for 
pseudouridines, in which the pentose sugar is attached to the C5 position of the uracil 
nucleobase. Preferred nucleosides are those in which the nucleobase is a purine, a 7- 
deazapurine, a pyrimidine, a normal nucleobase or a common analog of a normal 
nucleobase and the pentose sugar is any of the exemplary pentose sugars listed above. 

" Normal Nucleoside :" refers to a compound consisting of a normal nucleobase 
covalently linked via the Nl (C, T or U) or N9 (A or G) position of the nucleobase to the 
CI' carbon of ribose or 2'-deoxyribose. 

" Nucleoside Analog :" refers to a nucleoside in which the pentose sugar is replaced 
with a pentose sugar analog. Exemplary pentose sugar analogs include, but are not 
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limited to, substituted or unsubstituted furanoses having more or fewer than 5 ring atoms, 
e.g. , erythroses and hexoses and substituted or unsubstituted 3-6 carbon acyclic sugars. 
One or more of the carbon atoms may be independently substituted with one or more of 
the same or different -R, -OR, -NRR or halogen groups, where each R is independently 
hydrogen, (C r C 6 ) alkyl or (C 5 -C 14 ) aryl. 

" Nucleotide :" refers to a nucleoside in which one or more, typically one, of the 
pentose carbons is substituted with a phosphate ester having the formula: 
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where a is an integer from 0 to 4. Preferably, a is 2 and the phosphate ester is 
attached to the 3'- or 5'-carbon of the pentose. Particularly preferred nucleotides are those 
15 in which the nucleobase is a purine, a 7-deazapurine, a pyrimidine, a normal nucleobase 
or a common analog thereof. 

" Normal Nucleotide :" refers to a normal nucleoside in which the 3'- or 5'-carbon 
of the ribose or 2'-deoxyribose is substituted with a phosphate ester having the formula: 
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where a is an integer from 0 to 2. Preferred normal nucleotides are those in which a is 2 
25 and the phosphate ester is attached to the 5'-carbon of the ribose (NTP) or 2'-deoxyribose 
(dNTP). 

" Nucleotide Analog :" refers to a nucleotide in which the pentose sugar and/or one 
or more of the phosphate esters is replaced with its respective analog. Exemplary pentose 
sugar analogs are those previously described in conjunction with nucleoside analogs. 
30 Exemplary phosphate ester analogs include, but are not limited to, alkylphosphonates, 
methylphosphonates, phosphoramidates, phosphotriesters, phosphorothioates, 
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phosphorodithioates, phosphoroselenoates, phosphorodiselenoates, 

phosphoroanilothioates, phosphoroanilidates, phosphoroamidates, boronophosphates, etc., 
including any associated counterions, if present. 

Also included within the definition of "nucleotide analog" are nucleobase 
monomers which can be polymerized into polynucleotide analogs in which the 
DNA/RNA phosphate ester and/or sugar phosphate ester backbone is replaced with a 
different type of linkage. 

" Nucleoside/tide :" refers to a nucleoside and/or a nucleotide and/or a mixture 

thereof. 

" Polynucleotide :" refers to a linear polymeric chain of nucleoside monomer units 
that are covalently connected to one another by phosphate ester internucleoside linkages. 
Unless stated otherwise, "polynucleotide" as used herein includes polymers of any length, 
including oligonucleotides, polynucleotides and polynucleotides as those terms are 
commonly used in the art. Where polynucleotides of specific size ranges are intended, the 
number of monomer units is specifically delineated. Thus, polynucleotides according to 
the invention can range in size from a few monomer units (e.g., 4 to 40), to several 
hundreds of monomer units, to several thousands of monomer units, or even more 
monomer units. Whenever a polynucleotide is represented by a sequence of letters, e.g. 
"ATGCCTG," it will be understood that the sequence is presented in the 5'^3' direction. 
2'-Deoxyribopolynucleotides are preceded with the letter "d," e.g. "d( ATGCCTG)." 

Polynucleotides may be composed of a single type of sugar moiety, as in the case 
of PvNA and DNA, or mixtures of different sugar moieties, as in the case of RNA/DNA 
chimeras. 

4. DETAILED DESCRIPTION OF THE INVENTION 

This invention is directed to an improved invasion assay that provides several 
advantages when compared to prior assays. These advantages include, but are not limited 
to, less probe necessary to provide a reliable result, less time required to provide a reliable 
result, the ability to determine or estimate assay results in real time (i.e., while data is 
being collected), and the ability to obtain reliable results without running control assays. 
The invention further encompasses a novel method of analyzing data provided by prior 
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invasion assays. In each of the methods of the invention, data collection and/or analysis is 
preferably done by computer. 

This invention is based on the discovery of time-dependent behavior of certain 
invasion assays, which are referred to herein as "cascade assays" or "cascade invasion 
assays." hi cascade assays of this invention, such as those represented in FIG. 2, an 
invader oligonucleotide can hybridize to the target, or template, polynucleotide of interest 
(e.g., DNA or RNA). Invader oligonucleotides can be any oligonucleotide at least a 
portion of which can hybridize to the portion of the template that is of interest. Probes of 
the invention can hybridize with a region of the template that is contiguous and 
overlapping with the region to which the invader oligonucleotide can hybridize or 
otherwise associate. Invader oligonucleotides can contain regions that do not hybridize 
with the template, yet affect the cleavage of probe oligonucleotides. See, e.g., Figure 5 of 
Harrington, J.J., and Lieber, M.R., J. Biol. Chem. 270(9):4503-4508, 4506 (1995), and the 
accompanying text. 

Examples of invader oligonucleotides suitable for use in this invention include, 
but are not limited to, those identified as the "second oligonucleotide," "second oligo," or 
"invader" in U.S. Patent No. 5,994,069 which is incorporated herein by reference. See, 
e.g., U.S. Patent No. 5,994,069 at col. 37, lines 41-48. See also U.S. Patent Nos. 
5,843,669, 5,846,717, 5,874,283, 5,888,780, 5,985,557, and 6,001,567, each of which is 
incorporated herein by reference; and International Patent Application Nos. WO 
97/27214, WO 98/23774, and WO 98/42873, each of which is incorporated herein by 
reference. Examples of probes include, but are not limited to, those identified as the "first 
oligonucleotide," "first oligo," or "probe" in U.S. Patent No. 5,994,069. See, e.g., U.S. 
Patent No. 5,994,069 at col. 37, lines 37-41. See also U.S. Patent Nos. 5,843,669, 
5,846,717, 5,874,283, 5,888,780, 5,985,557, and 6,001,567; and International Patent 
Application Nos. WO 97/27214, WO 98/23774, and WO 98/42873. 

The interaction between the invader oligonucleotide, probe oligonucleotide, and 
template, or target, polynucleotide in the assays of this invention is described, for 
example, in U.S. Patent No. 5,994,069, International Patent Application No. WO 
98/42873, and Harrington, J.J., and Lieber, M.R., J. Biol. Chem. 270(9):4503-4508 
(1995), each of which is incorporated herein by reference. See, e.g., U.S. Patent No. 
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5,994,069 at col. 37, line 30 - col. 41, line 41; and Figure 5 of Harrington, J. J., and Lieber, 
M.R., J. Biol. Chem. 270(9):4503-4508, 4506 (1995), and the accompanying text. 

In a typical assay of the invention, if the invader oligonucleotide and probe 
oligonucleotide both anneal to the target substrate and overlap by at least one nucleotide, 
5 a cleavage structure is formed which comprises a flap of the probe. This flap can be 
cleaved by, for example, an endonuclease. Examples of endonucleases include, but are 
not limited to, those disclosed by U.S. Patent Nos. 5,843,669, 5,846,717, 5,874,283, 
5,888,780, 5,985,557, 5,994,069, and 6,001,567. Particularly preferred endonucleases are 
described, for example, by International Patent Application No. WO 98/23774. When 

10 cleaved, the flap can then anneal to another structure, forcing its cleavage and the 
production of yet another molecule. If desired, this process can continue a chain of 
reactions that will eventually provide a signal detected by the observer. Typically, 
however, the cleaved probe flap formed in the primary reaction induces the cleavage of 
what is referred to herein as a reporter precursor. Typically, the reporter precursor 

15 comprises reporter and quencher moieties, such as are shown in FIG. 2. Examples of 
reporter precursors are the hairpin structures disclosed, for example, by U.S. Patent No. 
5,994,069. See, e.g., U.S. Patent No. 5,994,069 at col. 72, lines 1 1-67. See also U.S. 
Patent Nos. 5,843,669, 5,846,717, 5,874,283, 5,888,780, 5,985,557, and 6,001,567; and 
International Patent Application Nos. WO 97/27214, WO 98/23774, and WO 98/42873. 

20 Cleavage of the reporter precursor provides a reporter that can be detected by 
conventional means such as, but not limited to, fluorescence. 

It has now been discovered that the signal generated by a cascade (i.e., at least 
two) of cleavage reactions typically exhibits different behavior over time than a signal 
generated by different reactions or series of reactions. Reactions that provide a signal, but 

25 are not indicative of the cleavage process of interest (e.g. , the primary reaction shown in 
FIG. 2), are referred to herein as background reactions. In typical cascade invasion assays 
of the invention, the background signal is linear in nature and can be fit to an equation of 
the formula: 

30 S(t) = p,+p 2 t (1) 
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wherein t is time, Pj is the signal at t = 0, and p 2 is the rate of the signal increase. By 
contrast, the signal produced as a result of the primary reaction exhibits quadratic 
behavior, and is best fit to an equation of the formula: 



5 S(t) = Pl +p 2 t + p 3 t 2 (2) 

wherein p 3 is greater than zero. This difference in behavior is shown, for example, in 
FIG. 3. 

Without being limited by theory, it is believed that the different time-dependent 

10 behavior of the background and target signals is due to the cleavage of multiple probes 

that can associate near or at their melting point with a single target or template to which a 
single primary invader oligonucleotide is also associated. The cleavage of each of these 
probes induces the cleavage of a reporter-quencher complex. Thus, over time, the overlap 
of a single primary invader oligonucleotide with multiple probes on a template can yield a 

15 large number of reporter fragments. By contrast, it is believed that the association of a 
single primary invader oligonucleotide with a reporter-quencher complex only produces 
one reporter fragment. 

Whatever the actual cause of the difference in their time-dependent signals, it is 
now possible to differentiate between positive assay signals and background noise by 

20 simply observing the signal provided by an invasion assay as a function of time. For 
many cascade assays, this can be accomplished by simply fitting the data to Equation 2 
and determining whether p 3 is greater than zero. As a practical matter, however, this 
approach has some limitations. 

Theoretically, a linear growth curve is expected to produce a value of zero for the 

25 t 2 coefficient (p 3 ) when modeled using Equation 2. In practice, however, experimental 
noise makes it likely to have a non-zero value. It is therefore preferred that the assay 
conditions (e.g. , the reagents used, the concentrations of the reactants, the specific invader 
oligonucleotides and probe oligonucleotides used, and the temperature) be adjusted to 
minimize background noise. However, another problem in assessing the significance of 

30 the t 2 coefficient is that the value of p 3 is highly influenced by factors such as, but are not 
limited to, reaction kinetics, acquisition time, and the concentrations of reactants. 



-10- 



DCl - 249371 2 



Consequently, the coefficient p 3 can assume any value in a large range of values. A 
preferred embodiment of the invention therefore limits the range of values that can be 
assumed by p 3 . By doing so, the significance of p 3 can be assessed both empirically and 
theoretically in a realistic and predictable manner. 
5 Although various ways in which this can be done will be apparent to those skilled 

in the art, one particular way in which this can be accomplished is by transformation. In 
this particular method, the independent variable, t, and the dependent variable, S(t), of 
Equation 2 are transformed to t* and S*(t*), respectively, such that their minima are each 
equal to zero and their maxima are each equal to unity. This transformation accomplishes 

10 two important goals. First, the transformation space (t*, S*(t*)) consists of three different 
regions: the linear growth functions fall within the first region, the quadratic growth 
functions in which p 3 is positive fall within the second region, and the quadratic growth 
functions in which p 3 is negative fall within the third region. Second, the value for p 3 
associated with growth curves resulting from invasive cleavage is bounded. 

15 Under optimal conditions, the value of p 3 is less than or equal to unity (i.e., p 3 < 

1). A threshold value for p 3 can thus be set independent of the conditions under which the 
invasive assay is run. FIG. 4 shows an idealized linear growth in signal due to 
background (negative) and an idealized quadratic growth in signal due to primary 
invasive cleavage (positive). FIG. 5 shows the effect of domain transformation on a 

20 family of response functions simulated with added random noise. To be specific, the top 
panel of FIG 5 shows three different response functions (red), three different quadratic 
responses with positive curvature (green), and three different quadratic responses with 
negative curvature (yellow). As shown in the top panel, the response function can be in 
any region of the response space depending on factors such as, but not limited to, the 

25 concentration of the target, or template, molecule, and the nature of the background 
signal(s). By contrast, the transformation domain maps the three families of curves to 
distinct regions in the new domain. This is shown in the bottom panel of FIG. 5, wherein 
the three linear response functions are mapped to the same region and appear as one line, 
and the quadratic response functions appear on either side of that line depending on 

30 whether their curvature is positive or negative. The domain transformation of this 
invention thus achieves natural clustering of the response curves, which allows easy 
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classification of the data obtained from the invasion assay. See Example 2, below. 

Specific embodiments of the invention take into consideration the effects 
background (e.g., experimental) noise can have on the certainty of an analysis of assay 
data. For example, the negative region denoted by the red line in FIG. 5 is expected to 
5 have a finite width. The size of the permissible negative region is consequently 

determined in view of experimental and instrumental conditions, as well as the desired 
rate of false positive. For example, after the response function is transformed, the 
analytical signal can be modeled using linear and quadratic models and utilizing the least 
squares criteria, or any other criteria known to those skilled in the art. Referring to 

10 Equation 2, the t 2 term is then examined in view of the width of the negative region. The 
significance of the quadratic growth can then be estimated using the growth model 
attributed to background noise kinetics (e.g., linear growth), as a benchmark or absolute 
standard. In preferred embodiments of this invention, deviation from linearity is a 
measure of significance. This can be determined by, for example, the residuals of an 

15 assumed linear model. 

A specific method of the invention can thus be described in terms of the following 
steps: (i) transformation of the growth curve data to new domain; (ii) the fitting of linear 
and quadratic models to the transformed growth curve data set; (iii) comparison of the t 2 
term (p 3 in Equation 2) to the threshold values set to define the negative region; and (iv) if 

20 the growth curve is found to be in the positive region, the evaluation of the residuals 

obtained from the linear model fit for their significance. This specific embodiment of the 
invention is described in more detail in Example 2, below. 

The improvements provided by this invention can be used with any variant of the 
invasion assay wherein the cleavage of one structure induces the cleavage of at least one 

25 additional structure. Examples of such assays include, but are not limited to, those 

disclosed by U.S. Patent Nos. 5,843,669, 5,874,283, 5,888,780, 5,985,557, 5,994,069, and 
6,001,567, each of which is incorporated herein by reference. In general, however, this 
invention can be applied to any assay wherein the desired signal exhibits different 
behavior as a function of time than that of the background signal(s). Instead of exhibiting 

30 quadratic behavior, for example, such a signal may exhibit behavior best represented by a 
higher order polynomial, such as is represented by Formula 3: 
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S(t) = E Pj t k G = l,2,...;k = 0,l,2,...) 



(3) 



Other mathematical relationships (e.g., exponential) may also best represent the signal of 
interest produced by a cascade assay. At the same time, the time-dependent background 
5 signal of an assay need not be linear: all that is necessary is a time-based behavior that 
can be determined and differentiated from that of the signal of interest. 

In view of the breadth of the invention, and the fact that it can be applied to wide 
array of assays in addition to those currently known in the art, the following Examples are 
not to be construed as limiting the scope of the present invention. 

10 

5. EXAMPLES 

5.1. EXAMPLE 1: DATA ACCUMULATION AND SIMPLE 
ANALYSIS 

Advantages provided by this invention are apparent from the following 

15 experiment, which was performed using materials available from Applied Biosystems 
(Foster City, California) and Third Wave Technologies (Madison, Wisconsin). An ABI 
PRISM 7700, available from Applied Biosystems, was used for data collection. 

Using 100 ng human genomic DNA per test, the following reagents were used in 
the assay of this invention for detecting a C/T polymorphism in the human MFTHR gene: 

20 a "C" Probe Mix containing a flap-plus-probe oligonucleotide specific for the MTHFR C 
allele and the MTHFR-specific invader oligonucleotide; a "T" Probe Mix containing flap- 
plus-probe oligonucleotide specific for the MTHFR T allele and the MTHFR-specific 
invader oligonucleotide; a Signal Probe comprising fluorogenic oligonucleotide labeled 
with FAM reporter and CY3 quencher that is substrate for the secondary invasion 

25 reaction; a Signal Buffer which includes Mg 2+ ; and Cleavase-VIII™, which is a structure- 
dependent endonuclease that performs the cleavage reaction. 

Human genomic DNA from 90 different individuals was analyzed in the 
experiment. The DNA from these individuals had previously been dried down in 7700 
reaction plates, each well containing approximately 100 ng DNA. Each plate has 6 wells 

30 with no template. In one plate, 10 ul diluted "C" Probe Mix was added to each well. In a 
separate plate, 10 ul diluted "T" Probe Mix was added to each well. These plates were 
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incubated at 95 °C for 5 minutes to denature the DNA, then placed on ice. The use of the 
ice incubation, which was not used in prior invasion assays, is designed to better preserve 
the single- stranded nature of the denatured DNA. 

A reaction cocktail was made by mixing Signal Probe, Signal Buffer, Cleavase- 
5 Vin®, and ROX Passive Reference. Five microliters of this reaction cocktail was added 
to each well. After capping the wells, each plate was incubated at 65 °C for 90 minutes in 
the ABI PRISM 7700 with fluorescence data collected throughout. The fluorescence data 
were analyzed by using multicomponenting to determine the contributions of FAM, CY3, 
and ROX. For each well, the FAM signal was divided by the ROX signal and plotted 

10 versus incubation time. 

FIG. 3 shows the results for five wells from the "T" reaction plate. This plot 
shows the results from four negative samples and one positive sample (the curved trace). 
The negative samples show a linear increase in normalized FAM signal. This indicates 
there is background cleavage of the signal probe in the absence of the appropriate 

15 template. The same linear increase is signal is observed in No Template Control 

reactions. For the positive sample, the rate of FAM signal increase is accelerated relative 
to the linear increase seen in the negative samples. In fact, the increase in FAM signal 
observed in positive reactions is fit very well by a quadratic equation. Thus, positive and 
negative reactions can be distinguished by the second order term in the quadratic fit. A 

20 significant second order term indicates a positive reaction, whereas a second order term 
close to zero indicates a negative reaction. This is made clear by FIG. 6, which shows the 
second order coefficients from all the reactions, ranked from lowest to highest. In each 
plot, the vertical line running from the bottom to the top of the chart shows the separation 
of positive and negative samples. The student t-test indicates a 99.5% confidence for the 

25 separation in each case. Based on these separation lines, the genotype of each individual 
can thus be determined from the second order coefficients, as shown in Table 1 . 
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In Table 1, the column marked "TaqMan Calls" reports the genotypes previously 
determined for these individuals using a TaqMan™ assay. As used in the table, "1" 
means C homozygote; "2" means T homozygote; and "1 and 2" means C/T heterozygote. 
The red circle shows the only discrepancy between the INVADER™ and TaqMan™ data. 

5 

5.2. EXAMPLE 2: ALTERNATIVE DATA ANALYSIS 

Comparison of the coefficients determined from a fit of the data obtained in 
Example 1 can be used to evaluate the results of assay. This simple method can provide 
erroneous results, however, since even small quadratic coefficients can indicate a positive 
10 signal, while relatively large ones may simply be artifacts of the fitting process. 

Therefore, it is preferred that analysis of the data, whether performed at the conclusion of 
the experiment or in real time, focus instead on the shape of the function to which the data 
is fit, rather than the coefficients calculated by simply fitting the data to a quadratic 
equation. 

15 Using the transformation method described herein, transformed response functions 

were generated from the data obtained in Example 1 . FIG. 7 shows these transformed 
response functions. The statistical significance of these results were then determined. 

The top panel of FIG. 8 shows the transformed signals from wells 5 (negative) and 
1 9 (positive) of the data described in Example 1. The middle and bottom panels show the 

20 residuals obtained from fitting a linear growth model to the negative and positive profiles, 
respectively. In the case of well 5, where the signal growth is linear, the residuals exhibit 
the expected behavior; they are distributed about zero and cross the "zero line" very 
frequently. By contrast, the residuals from well 19 (where the signal growth is quadratic, 
see the top panel of FIG. 8) cluster on the positive side of zero and the negative side of 

25 zero, and do not cross the zero line as frequently. The larger the deviation from linearity, 
the fewer the number of crossing. Thus, the number of zero crossings in a population of 
residuals obtained after fitting a linear model to a time-varying profile, in this case the 
transformed signal vs. the transformed time, is an indicator of how significantly the 
profile deviates from a straight line. See, e.g., S. Siegel, "Nonparametric Statistics," 

30 McGraw-Hill, New York, 1956, p52; and N. Draper and H. Smith, "Applied Regression 
Analysis," Third Edition, John Wiley & Sons, Inc., New York, 1998, pi 92. 
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The number of crossings was examined in terms of the number of runs in the 
residuals. A ran may be defined as a succession of residuals with the same sign which are 
preceded and followed by residuals of a different sign or by no residuals. See S. Siegel, 
"Nonpararnetric Statistics," McGraw-Hill, New York, 1956, p52. For example the 
5 following set of 10 residuals consisting of {+0.5, +0.1, -0.2, -0.4, -0.5, -0.6, -0.3, -0.2, 
-0.1, and +0.2} is said to contain 3 runs. The number of runs in any given population of 
residuals is an indication of whether or not the residuals are randomly distributed. For a 
large {e.g., greater than about 20) sample of residuals where there are n positive residuals, 
m negative residuals, and r runs, the significance is estimated from the expected 
10 distribution of r whose mean, u r and standard deviation, o\, are given as: 

\i r = 1 + (2nm)/(n + m) (4) 

c r = (A/B)* (5) 



15 



20 



where A = 2nm(2nm - n - m) and B = (n + m) 2 (n + m - 1). A normalized r value, r*, is 
calculated according to: 

r* = (r - u r )/a r (6) 



Equation (6) expresses the number of runs, r, in terms of its deviation from 
expected value, given the sample size, n and m, in units of standard deviations. The 
significance of such deviation is consequently assessed under the null hypothesis that n 
and m occur in random order. The null hypothesis is rejected when r* cannot be 

25 accounted for by random arrangement at a given level of significance. In such cases the 
residuals are deemed systematic and the linear model invalid, i.e., the growth curve is 
non-linear at the desired level of significance. 

The middle and bottom panels of FIG. 8 show the r* values estimated by Equation 
(6) for wells 5 and 19. In the case of well 5, the normalized number of runs is within half 

30 a standard deviation unit from the expected mean; the linear model is, therefore, accepted. 
In the case of well 19, the normalized number of runs is more than 23 standard deviation 



-17- 



DCl -249371 2 



units from the expected mean and the linear model is rejected. While many statistical 
models maybe used, Tchebycheff s model provides a robust, and conservative, approach. 
See, e.g., R. Kirk, "Introductory Statistics," Wadsworth Publishing Company, Inc., 
Belmont, CA, 1978, p83. Tchebycheff s theorem can be summarized as shown in 
5 equation (7): 

P{|x- uj > ko} < 1/k 2 (7) 

According to equation (7), the probability that a random variable is greater than k 
10 units of standard deviation away from its mean is less than 1/k 2 . Consequently, the larger 
the absolute value of r* in equation (6), the higher the probability that the residuals are 
not randomly distributed about zero and the linear model is rejected at the (l-(l/k 2 )) 
significance level (also referred to as "confidence level"). 

The statistical significance test summarized above provides important feedback to 
15 the users vis-a-vis the selected threshold for p 3 and the width of the 'negative region." As 
mentioned above, growth curves with p 3 values that exceed a threshold value are tested 
for significance. When the threshold is set too low, the results classified as positives, 
because the t 2 terms exceeds the threshold value, are expected to have a high number of 
runs, and, thus, low significance level (k in Equation (7)). Users can therefore use the 
20 significance level as a guide in setting the limits of the "negative region." 

The distribution of runs in the residuals enables users to obtain robust and 
sensitive statistical confidence estimates. Because the linear model provides an 
"absolute" standard, unbiased estimation of detection and classification 
confidence/significance can be derived from a single sample. 

25 

The embodiments of the invention described above are intended to be merely 
exemplary, and those skilled in the art will recognize, or will be able to ascertain using no 
more than routine experimentation, numerous equivalents of the specific materials, 
procedures, and devices described herein. All such equivalents are considered to be 
30 within the scope of the invention and are encompassed by the appended claims. 
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